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Article Info ABSTRACT _ 

In this paper, the authors present an FPGA implementation of a digital delay 
for beamforming applications. The digital delay is based on a Parallel Farrow 
Filter. Such architecture allows to reach a very high processing rate with 
wideband signals and it is suitable to be used with Time-Interleaved Analog 
to Digital Converters (TI-ADC). The proposed delay has been simulated in 
MATLAB, implemented on FPGA and characterized in terms of amplitude 
and phase response, maximum clock frequency and area. 
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1. INTRODUCTION 

The beamforming technique [1-5] is based on the combination of M different signals coming from 
M antennas. Such combination is obtained by delaying and summing the signals in order to produce additive 
interferences in some directions and destructive interferences in others. 

In this application, the delay blocks represent a crucial element. When the beamforming involves 
narrowband signals, the delays can be realized with simple phase shifters, implemented with complex 
multipliers. Vice versa in case of wide-band signals, delay blocks must be implemented using more complex 
circuits [6-7]. A common solution consists in the use of fractional delay filters. These filters are able to 
generate delays, which are a fraction of the system clock cycle. In wideband applications, as for example 
wideband beamforming, the filters must comply with some specifications, namely: 

a. Large bandwidth (ideally Nyquist frequency) 

b. Reduced Magnitude ripple 

c. Reduced Phase ripple 

d. Ideally constant Group delay 

These requirements can be easily met using the Weighted Least-Squares (WLS) method 
implemented with the Larrow architecture [8-13]. However, as the processing rate increases, e.g. more than 1 
GHz, the hardware implementation could present some complications, for example the high sample rate that 
makes impossible the use of LPGAs. The reason is the impossibility of LPGAs to reach processing rates 
beyond the GHz. In fact, although modern Time-Interleaved ADCs [14-16] (TI-ADC) are able to provide 
wide-band signals with sample rates over the GHz, LPGAs are not able to process such signals without 
decimation and, consequently, without reducing the bandwidth of signals involved in processing [17, 18]. 
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In this paper, the authors present an FPGA based digital delay for wideband digital beamforming. 
The proposed solution is able to reach preocessing rates compatible with actual TI-ADC. Such digital delay 
is based on Parallel FIR Filters which are used to compose a Parallel Farrow Filter. 


2. RESEARCH METHOD 

Farrow filters [19-21] represent the most common solution for the implementation of fractional 
delay filters. They are widely discussed in the literature and a detailed analysis is provided in [20]. In Figure 
1 the block diagram of a Farrow filter is provided. The filter is composed of delay blocks, adders, multipliers 
(used for the selection of the delay entity) and M subfilters. Subfilters are represented in the figure with 
blocks named Q(z) with 0<I<M-1. As discussed in [3, 4], modern TI-ADCs for high speed/wide-band 
applications are usually composed of 2 or 4 ADC cores. Each core provides the output on a separate bus. All 
the cores work in parallel and the total sample rate of the ADC is the sum of the sample rates of the 
single cores. 

In other words, the total sample rate is L-f sa where F is the number of cores and f sa is the sample rate 
of each core. The idea of the proposed digital delay is to parallelize the Farrow architecture in order to 
process the incoming parallel data from the TI-ADC cores as shown in Figure 2. This is possible by 
parallelizing the sub-filters that compose the Farrow filter. The parallel architecture is based on polyphase 
filters banks [22, 23]. This technique allows to reduce the operating frequency by parallelizing the filters and, 
as consequence, could be used to reduce the system power consumption by acting on power supply [24]. 

To achieve such parallelization, we introduce a new polyphase architecture that we call Parallel- 
Polyphase. This architecture, differently from traditional polyphase architectures, is able to process data 
without any decimation and consequently without any bandwidth reduction. 



Figure 1. Farrow filter architecture 


Fet's consider the impulse response c^n] of a generic sub-filter Q composing the Farrow filter. In 
order to implement the architecture shown in Figure 2, each sub-filter must be parallelized by a factor equal 
to the number of TI-ADC available cores (Figure 2 shows the case of a TI-ADC composed by 4 cores). In the 
following, we get the equations for a generic parallel filter having F inputs and F outputs, where F is the 
number of the TI-ADC cores. We transform the discrete convolution of a classic SISO (Single Input, Single 
Output) FIR filter into a parallel convolution to model a MIMO (Multi-Input, Multi-Output) FIR Filter. A 
FIR filer is described by the discrete convolution: 

y[n] = Y,iJ 0 lx [n - i] c [i] (!) 

where N is the length of the filter output and c[n] is the impulse response of Q(z). 
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Figure 2. Parallel farrow filter with L=4 


Applying a factor L polyphase decomposition [25] to the output y[n], we split the output in L 
branches yk[n] with k= 0, 1, 2 to L-l: 

y/cN =y[Ln + k]= Y!-~Qx[Ln + k - i]c[i] (2) 

Note that each output y k [n] depends on every sample of the input sequence x[n]. Because TI-ADCs 
provide the inputs in parallel (each core has an independent bus), input x[n] in the equation must be also 
parallelized. This is possible using a change of variables: i=Lm+l with 
0 < 1 < L-l and 0 < m < M-l where M is the length of the subfilters obtained by the polyphase decomposition 
of c[n] [6]. The new equation of the model is shown in (3). 

ydn] = £[=oG]m=o X \- Ln + (k ~ 0 - Lm]c[Lm + /]) (3) 

(3) describes the L outputs y k [n] in function of the L inputs. In terms of Z transform we have: 

Y k (z L ) = Y, L aX k _i(z L )H L (z L ) (4) 

where Z{x[Ln + (k — 1) — Lm]} = X k _j(z L ) are the L parallel inputs provided by the TI-ADC and Z{c[Lm + 
1]} = H^z 1 ) is the subfilter of c[n]. The term X k _ } may have a negative subscript. For this reason, we 
consider p=k-l and we rewrite X k _i with subscript between 0 and L-l: 

X p (z L ) = Z{x[Ln + /?]} = Z{x[L(n - 1)] + (L + /?)} = X L+jS {z L )z~ L (5) 
Consequently: 

k L-l 

Y k (z L )=Y j X K -i(z L )H i (z L )+ ^ X L+(fc _ 0 (z L )^(z L )z- L 

i =0 i=k +1 

( 6 ) 

L-l 

Y L -i{z l ) = Y j X N -i(.z l ) Hi(z L ) 

i =0 
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For example, choosing L=4, a generic filter Ci(z) can be parallelized as shown in Figure 3. The 
polyphase sub-filters Hi(z L ) are computed as shown in [6]. Note that the regularity of such architecture allows 
an easy scalability for any value of L. 



Figure 3. Parallel polyphase decomposition for F=4 


3. RESULTS AND DISCUSSION 

The architecture of Figure 2 has been coded in VHDL and implemented on a XILINX XCVU9P- 
L2FLGA2104E FPGA [26]. After the hardware implementation, the digital delay has been tested injecting 
sinusoids at the input. Keeping a 16-bit resolution in the entire data-path (also inputs are represented with 16 
bits) we obtain the following results: 

a. Magnitude ripple < 0.2 dB. 

b. Phase ripple < 2°. 

c. Minimum delay 10 ps. 

d. Maximum clock frequency (500 MHz) 

The Farrow filter is composed by M=5 subfilters with a length of N=ll. The maximum clock 
frequency of 500 MHz allows the reaching of 2 GSPS using 4 cores. In Figure 4 the magnitude and the phase 
error response in function of the delay are provided. Figure 5 shows examples of delay in the time domain. 


Table 1. Resources utilization 


Resources 

Utilization 

Available 

Utilization% 

LUT 

4,449 

1,182,240 

0.38% 

LUT RAM 

784 

591,840 

0.13% 

FF 

10,735 

2,364,480 

0.45% 

DSP 

80 

6,840 

1.17% 
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WLS: Magnitude (dB) WLS: Phase Error Response (deg) 



Figure 4. Bode diagrams of the proposed digital delay 



Figure 5. Time response with delay 31.25 ps and 62.5 ps 


4. CONCLUSION 

In this paper, a digital delay for beamforming has been presented. The proposed digital delay is 
based on a parallel polyphase decomposition that can be easily implemented on FPGA or ASIC. Thanks to its 
regularity, this parallel polyphase decomposition can be easily generalized for any value of L. The introduced 
digital delay allows FPGAs to process wide-band signals without any decimation and consequently without 
any bandwidth reduction. We have shown an implementation’s example able to process a 2 GSPS signal 
using a TI-ADC with 4 cores and a 500 MHz clock frequency. The signals can be delayed of small quantities 
up to lOps with Magnitude Ripple and Frequency Ripple respectively less than 0.2 dB and 2°. 


REFERENCES 

[1] Steyskal, Hans. “Digital Beamforming Antennas: An Introduction”. Microwave journal , 1987; 30 (1): pp. lOp 
between p 107 and 124 

[2] R. Mucci, "A comparison of efficient beamforming algorithms," in IEEE Transactions on Acoustics, Speech, and 
Signal Processing , vol. 32, no. 3, pp. 548-558, June 1984. 

[3] Malek, N.A., Khalifa, O.O., Abidin, Z.Z., Mohamad, S.Y., Abdul Rahman, N.A. Beam steering using the active 
element pattern of antenna array. Telkomnika (Telecommunication Computing Electronics and Control ), 2018; 
16(4): 1542-1550. 

[4] M. Younis, C. Fischer and W. Wiesbeck, "Digital beamforming in SAR systems," in IEEE Transactions on 
Geoscience and Remote Sensing , vol. 41, no. 7, pp. 1735-1739, July 2003. 

[5] D. E. Dudgeon, "Fundamentals of digital array processing," in Proceedings of the IEEE , vol. 65, no. 6, pp. 898-904, 
June 1977. 


Bulletin of Electr Eng and Inf, Vol. 8, No. 2, June 2019 : 422 - 427 


















Bulletin of Electr Eng and Inf 


ISSN: 2302-9285 


□ 427 


[6] Cardarilli G.C., Di Nunzio L., Fazzolari R., Re M., Rufolo G., Bernocchi G. Analog chain calibration in Digital 
Beam-Forming applications. ARPN Journal of Engineering and Applied Sciences , 2018 13 (2), pp. 752-760. 

[7] C. Cheung, R. Shah and M. Parker, "Time delay digital beamforming for wideband pulsed radar 
implementation," 2013 IEEE International Symposium on Phased Array Systems and Technology , Waltham, MA, 
2013, pp. 448-455. 

[8] C. K. Chu and Yee-Hong Leung, "Further results on the WLS design of variable fractional delay filters," 2012 6th 
International Conference on Signal Processing and Communication Systems , Gold Coast, QLD, 2012, pp. 1-7. 

[9] T. Deng, "Symmetric Structures for Odd-Order Maximally Flat and Weighted-Least-Squares Variable Fractional- 
Delay Filters," in IEEE Transactions on Circuits and Systems I: Regular Papers , vol. 54, no. 12, pp. 2718-2732, 
Dec. 2007. 

[10] Wu-Sheng Lu and Tian-Bo Deng, "An improved weighted least-squares design for variable fractional delay FIR 
filters," in IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing , vol. 46, no. 8, pp. 
1035-1040, Aug. 1999. 

[11] Chien-Cheng Tseng, "Design of variable fractional delay allpass filter using weighted least squares method," 2002 
IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353 ), Phoenix-Scottsdale, 
AZ, USA, 2002, pp. V-V. 

[12] S. Tahir, M. Elnamaky, M. A. Ashraf and K. Jamil, "Hardware implementation of digital beamforming network for 
Ultra Wide band signals using uniform linear arrays," The 2nd Middle East Conference on Antennas and 
Propagation , Cairo, 2012, pp. 1-4. 

[13] Cardarilli G.C., Giardino D., Matta M., Re M., Silvestri F., Simone L., Spano S. “Comparison and Implementation 
of Variable Fractional Delay Filters for Wideband Digital Beamforming”. Lecture Notes in Electrical Engineering, 
Article In Press 2019 

[14] F. Harris, Xiaofei Chen, E. Venosa and F. A. N. Palmieri, "Two channel TI-ADC for communication signals," 2011 
IEEE 12th International Workshop on Signal Processing Advances in Wireless Communications , San Francisco, 
CA, 201 l,pp. 576-580. 

[15] G. Manganaro and D. Robertson Interleaving ADCs: Unraveling the Mysteries https://www.analog.com/en/analog- 
dialogue/articles/interleaving-adcs 

[16] Interleaving ADCs for Higher Sample Rates Literature Number. SNAA111 
http://www.ti.com/lit/wp/snaal 11/snaal 1 l.pdf 

[17] Cappello S., Cardarilli G.C., Di Nunzio L., Fazzolari R., Re M., Albicocco P. “Flexible channel extractor for 
wideband systems based on polyphase filter bank Journal of Theoretical and Applied Information Technology , 
2017; 95(16): 3841-3850. 

[18] Cardarilli G.C., Di Nunzio L., Fazzolari R., Re M., Nannarelli A. “Power Efficient Digital Front-End for Cognitive 
Radio Systems”. Conference Record of the IEEE Asilomar Conference on Signals , Systems and Computers , 2018 

[19] V. Valimaki and T. I. Laakso, "Principles of fractional delay filters," 2000 IEEE International Conference on 
Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), Istanbul, Turkey, 2000, pp. 3870- 
3873 vol.6. 

[20] Laakso T.I., Valimaki V., Karjalainen M., Laine U.K.:”Splitting the unit: Delay tools for fractional delay filter 
design”. IEEE Signal Processing Magazine , 1996; 13(1), pp. 30-60. 

[21] C. K. S. Pun, Y. C. Wu, S. C. Chan and K. L. Ho, "An efficient design or fractional-delay digital FIR filters using 
the Farrow structure," Proceedings of the 11th IEEE Signal Processing Workshop on Statistical Signal Processing 
(Cat. No.01TH8563), Singapore, 2001, pp. 595-598. 

[22] P. P. Vaidyanathan, "Multirate digital filters, filter banks, polyphase networks, and applications: a tutorial," 
in Proceedings of the IEEE , vol. 78, no. 1, pp. 56-93, Jan. 1990. 

[23] Cardarilli, G.C., Di Nunzio, L., Fazzolari, R., Fereidountabar, A., Giuliani, F., Re, M., Simone, L. “Comparison of 
jamming excision methods for direct sequence/spread spectrum (DS/SS) modulated signal”, Journal of Theoretical 
and Applied Information Technology, 2017; 95(13): 2878-2888. 

[24] Cardarilli, G.C., Di Nunzio, L., Fazzolari, R., Re, M., Silvestri, F., Spano, S. “Energy consumption saving in 
embedded microprocessors using hardware accelerators”. TELKOMNIKA (Telecommunication, Computing, 
Electronics and Control ). (2018); 16(3): 1019-1026. 

[25] Mitra S.K., and Yonghong K. Digital signal processing: a computer-based approach. Vol. 2. New York: McGraw- 
Hill, 2006. 

[26] UltraScale Architecture Staying a Generation Ahead with an Extra Node of Value. 
https://www.xilinx.com/products/technology/ultrascale.html 


Efficient FPGA implementation of high speed digital delay for wideband... (Gian Carlo Cardarilli) 



